Integrating temporal and spatial scales: Human 

structural network motifs across age and 

region-of-interest size 

Q , Christoph Echtermeyer^'^, Cheol E. Han^, Anna Rotarska-Jagiela 

CN ; Harald Mohr^, Peter J. Uhlhaas^, Marcus Kaiser^'^'^'* 

July 21, 2011 

O 
(N 






^Institute for Chemistry and Bfology of the Marine Environment, Carl von Ossi- 

etzky University, Oldenburg, Germany 

•^School of Computing Science, Claremont Tower, Newcastle University, Newcastle- 
upon-Tyne NEl 7RU, UK 

•^Department of Brain and Cognitive Sciences, Seoul National University, Seoul 

151-746, Repubhc of Korea 
^ . ^Department of Neurophysiology, Max-Planck Institute of Brain Research, Frank- 

O ' furt a. M., Germany 

'c/3 . '^Goethe University, Institute of Psychology, Frankfurt a. M., Germany 

^~i' ^Institute of Neuroscience, The Medical School, Framlington Place, Newcastle 

^ ! University, Newcastle-upon-Tyne NE2 4IIII, UK 

*E-mail: m.kaiser@newcastle.ac.uk 

Running title: Human structural connectivity patterns across time and space 



Ph. 



0^ 
rn 

O 



X 



Abstract 

Human brain networks can be characterized at different temporal or 
spatial scales given by the age of the subject or the spatial resolution of 
the neuroimaging method. Integration of data across scales can only be 
successful if the combined networks show a similar architecture. One way 
to compare networks is to look at spatial features, based on fibre length, 
and topological features of individual nodes where outlier nodes form sin- 
gle node motifs whose frequency yields a fingerprint of the network. Here, 
we observe how characteristic single node motifs change over age (12-23 
years) and network size (414, 813, and 1615 nodes) for diffusion tensor 
imaging (DTI) structural connectivity in healthy human subjects. First, 
we find the number and diversity of motifs in a network to be strongly 
correlated. Second, comparing different scales, the number and diversity 
of motifs varied across the temporal (subject age) and spatial (network 
resolution) scale: certain motifs might only occur at one spatial scale or 
for a certain age range. Third, regions of interest which show one motif at 
a lower resolution may show a range of motifs at a higher resolution which 
may or may not include the original motif at the lower resolution. There- 
fore, both the type and localisation of motifs differ for different spatial 
resolutions. Our results also indicate that spatial resolution has a higher 
effect on topological measures whereas spatial measures, based on fibre 
lengths, remain more comparable between resolutions. Therefore, spatial 
resolution is crucial when comparing characteristic node fingerprints given 
by topological and spatial network features. As node motifs are based on 
topological and spatial properties of brain connectivity networks, these 
conclusions are also relevant to other studies using connectome analysis. 
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1 Introduction 



The s et of connections in the brain can be described as the connectome (jSporns et ah 



20051 ). Connectome data is currently becoming available at different levels of 
the structural organization: from neuronal networks of connections between 
neurons to fibre tract networks between brain regions. These data promise 
to give valuable new insights, but analyses integrating different data sets are 
challenging. One problem of data integration across labs is that the raw data 
might show different voxel resolutions due to different magnetic field strengths 
of the MRI (magnetic resonance imaging) scanner. In addition, different labs 
might use different parcellation routines or brain atlases. This leads to dif- 
ferent spatial resolutions and consequently to different s tructural connectiv- 
ity networks (eventually with different numbers of nodes) ( Zalesky et al.l . uOlOt 



, y 

Bassett et all . I2OIOI: iFornito et al.l . l2010t iHavasaka and Laurienti l2010t ). An- 
other problem of data integration is caused by age differences between subjects. 
There is a certain evidence that characteristics of the connectome vary over ag e 
(JFan et all . I2OIOI: iHagmann erall . l2010t iFair et al.l l2009t lUhlhaas et all . l2009l) . 
Given a network analysis result of two studies, say concerning small- world fea- 
tures (jWatts and Strogata . 119981) , are differences due to the study population's 
differences in connectivity or are they due to differences in age or network reso- 
lution? In this article, we observe how topological and spatial network features 



change across age and network resolution for structural connectivity, based on 
diffusion tensor imaging (DTI), in healthy human subjects. Using single node 
motifs, we also apply a network mapping technique, which yields a compact and 
easily comparable representation of complex network structures. 

1.1 Scales of a hierarchical organisation 

Hierarchy is a central feature in the organisation of complex biol ogical systems 



and particularly the structure a nd function of neural networks (jKaiser et al. 
I2OI 



I particularly tne structure a na lunction or neural networKs (jivaiser et ai. . 
0|; iKaiser and Hilgetaei 120101) . In neural networks, the term hierarchy may 



be understood in several different ways and can apply to topological, spatial, 
temporal as well as functional properties. These features can be observed at 
different scales or resolutions. 

Temporal scales 

Network analysis is often presented as the analysis of a network at a given 
time, but many networks change both over time and with temporal resolu- 
tion. Structural connectivity, given by anatomical connections betw een neurons 



or br ain regions, may change due to activity-dependent plasticity (jButz et al. 



20091 ) ■ Functional connectivity, given by correlations in the activity patterns of 



network nodes, can change for different tasks for a subject or due to different 
system states (e.g. awake vs. asleep). In our case, connectivity might also 
change over longer time-scales, as part of brain development. 

Recovered networks can appear differently depending on temporal resolu- 
tion. At a fine-grained temporal resolution of a few hundred milliseconds, 
cortical tissue can show dist inct oscillations, e.g. in the a, 7, or S frequency 



bands (|Roopun et al.l . 120081 ) . For more coarse-grained time-resolutions with 
sample lengths of several seconds or minutes such distinct frequency peaks dis- 
appear and_the frequency distribution displays a 1// power-law behaviour in- 



stead (JBuzsakil . |2006[) 



Here, we observe temporal scale in terms of snapshots of structural connec- 
tivity at different points in time. The human brain undergoes large structural 
developments until the age of around 20-25 years. These developments coin- 
cide with large functional changes during child development and during the 
teenager years. They further coincide with changes in functional connectivity 
(jUhlhaas et al.l . 120091) . Such changes during brain network development are 



not only of interest for the study o f healthy subjects, but als o for many psy- 
chiatric disorders like schizophrenia ( Uhlhaas and Singeij . 120101 ) , which have an 



onset around the time when the brain network matures (age 18-25 years). We 
therefore test the potential of single node motifs to integrate information across 
temporal network 'snapshots'. 

Spatial scales 

The network structure not only depends on the time of a 'snapshot' but also on 
the spatial resolution. One can distinguish the micro- and macro-connectome 
as the connectivity between individual neurons and connections between brain 
regions, respectively (|DeFelipa . i201Qi) . In this study of structural connectivity 



we focus on the macro-connectome level based on MRI data (see ISeuna ( 20091 ) 
for more information about neuronal connectivity instead). 

The macro-connectome can be analysed at different levels of spatial resolu- 
tion and thus different levels of brain network organisation. With a low spatial 
resolution, it corresponds to connectivity between brain areas where network 
nodes correspond to brain regions. The resulting networks s how features o f 



modular and small-world networks ([Hilgetag et al.l l2000t ISporns et al.l . I2OOOI ) 



and typically consist of up to 100 nodes per cortical hemisphere for the primate 
brain. At the mesoscale, connectivity between regions of interest of the same 
size, e.g. Icm^ cortical surface area, can b e studied. Such netwo rks consist of 
around 1,000 nodes for the human brain ( Haemann et al.L 120081) . With even 
higher resolution, the microscalc of the macro-connectome c ould be studied, 
i.e. connections between cortical columns (JMountcastlel 119971 ). However, such 
networks with nodes representing one cortical column each would consist of 
1,250,000 nodes for the human brain (both hemispheres, not including subcor- 
tical str uctures) and i dentifying columns with diamet ers ranging from 200fim 
to 1mm (JHubel et al.l . ll977tlHorton and Adamd . 120051 ) in humans is (currently) 
beyond the reach of standard MRI. Such extremely high resolutions might only 
be achieved with higher magnetic fields or extremely high magnetic fields in post 
mortem studies. 

We therefore study the global and regional scale of the macro-connectome. 
Importantly, this level allows us to assess whether motifs which are characteris- 
tic for a certain cortical region are also characteristic for all parts of that cortical 
area or whether a region displays a variety of motifs at higher spatial resolu- 
tions. Also, changes in spatial resolution affect spatial measurements, which 
form part of the motif detection. It is often interesting to observe how many 
connections go to nearby targets and how many extend over a long distance, 
potentially linking different components of the neural network. This can be 
readily observed using a histogram of the connection lengths of a network. It is 
known that the probabil ity that two neurons are connected decays almost expo - 
nentially with distance (JHellwid . I2OOOI : iKaiser et all 120091: ISchiiz et all l2005l) : 
connections over a long-distance are less likely than short-distance connections. 
In different organisms, ranging from neuronal connectivity in C. elegans and lay- 
ers in the rat visual cortex to fibre-tract connectivity in the macaque, the actual 
distribution w as approximated best by a Gamma probability density function 
( Kaiser et al.l . [2009). Looking at different spatial resolutions within humans, 
we would expect that the proportion of short-distance connections relative to 
long-distance connections increases when the spatial resolution is increased. We 
will therefore observe how the length distributions change over different spatial 
scales. 



1.2 Single node motifs 

Networks can be analysed following different concepts: To investigate a single 
network, for example, analysis can focus on its nodes, comparing them through 
connectional fingerprints based on the absol ute values of node p roperties (e.g. 
node degree or local clustering coefficient) (Passingham et al.l . 12002. ). On a more 
abstract level, nodes of brain networks can be regarded with respect to their dif- 
ferent organisation levels. These approaches look at one network of a particular 
subject, but networks can also be compared across different subjects — or differ- 



ent networks might exist for the same subject across time. However, mapping 
between networks becomes difficult if the number of nodes and edges varies, such 
that direct comparisons (i.e. link-to-link or node-to-node) between structures 
are ruled out. In contrast to such methods of cross-comparison, the approach 
used in this paper is less sensitive to these changes: Although a network with 
a slightly higher number of nodes could show additional motifs, most nodes are 
non-outliers making such changes less likely. Similarly, a network with slightly 
more edges might show absolute changes in network features without affecting 
the outliers of the feature distribution leading to similar single node motifs. 
Again, these comparisons of the number and classes of motifs become possible 
for non-identical numbers of nodes or edges but only if the numbers do not differ 
too much. These motif-classes are significantly different from each other and 
they allow to classify network nodes at the same level even if they appear in 
different networks. 

The concept of node-motifs — a combination of local network features — com- 
plements that of network-motifs, which are spec ific connect i vity p atterns that 
have been used to characterise networks before ( Milo et al.l . l2002r) . An exam- 



ple for a node motif are highly connected nodes or hubs that affect spreading 
phenomena and can be important components of a network. More complex, 
multi-dimensional node-motifs, which are characterised by multiple features in 
combination, specify nodes more comprehensively. With these more precise 
descriptions, new kinds of motifs can be formulated and ICosta et al.l ( 20091 ) 



presented a routine for their detection and specification. We proposed first im- 
provements and procedur es for automating parameter choices for this method 



(jEchtermever et al.l . 120111) . such that large numbers of networks can be pro- 
cessed and compared to each other. This enhanced technique is applied to 
brain networks in this paper. 

1.3 Comparing motif counts across networks 

The usefulness of mapping brain networks (which itself are abstract represen- 
tations) to motif-classes is demonstrated for two scenarios: First, regional con- 
nectomes of 53 children (aged 12-23 years) are compared to show how the 
fingerprint of a DTI network can change over time. Second, we test the effect 
of different connectome-resolutions on the resulting network-fingerprint. There- 
fore, DTI networks at different levels of organization were generated by changing 
the level of granularity from 414 regions of interest (ROIs) to up to 1,615 ROIs. 
Comparing networks from the same subject but at different granularities 
has so far been complicated by th e critical dependence of standard approaches 
on the resolution of the network ( Antiaueira et al.l . l2010l ). We show how our 



novel node fingerprints can be used to compare different levels of connectome 
organization. 

Note that comparing motifs has several advantages over comparing network 
features. Features, such as the small-world measures of clustering coefficient 
and characteristic path length, critically depend on the edge density of a net- 
work. Therefore, changes in these measures could be either due to changes in 
network organization (e.g. modularity) or due to a difference in the number of 
connections. To identify the changes related to network organization, the edge 
density needs to be constant when comparing networks. However, this leads to 
a severe problem: when transforming raw data of connection strengths between 



nodes to binary data, each network needs a different threshold for establish- 
ing a connection ( Rubinov and Spornsl . |2010|). To avoid the problem of either 



having different edge densities or different t hresholds, researchers increas ingly 
use weighted networks without thresholding (jBullinore and Bassettl . I2OIII) . For 



our analysis, we expect that differences in edge density are not only due to 
inter-subject variability, as in other studies, but are more heavily influenced 
by changes across different temporal or spatial scales. Therefore, we observe 
how network measures, and as a result node motifs, change over scales. These 
changes can be due to changes in organisation or edge density. 

Although a detailed analysis is outside the scope of this paper, we show 
the relation between both scales in Figure [TJ Whereas edge density decreases 
with network size, there are no significant changes between age groups when 
looking at networks with the same spatial resolution. However, we would like 
to point out that (a) motifs change over time and can therefore be used to 
find characteristic changes in the developing brain and (b) motifs change over 
spatial scales, therefore comparisons between studies are only meaningful when 
the same (or a comparable) spatial resolution has been used. 

2 Materials and Methods 

2.1 Recruiting and Data Recording 

We recruited 53 participants (age between 12 and 23 years) from local commu- 
nity and high schools. The study received ethics committee approval and written 
informed consents were obtained from the participants' parents for under-age 
participants, or from themselves if older than 18 years after study procedures 
were described. Participants were screened for a history of psychiatric and neu- 
rological disorders and current drug abuse using ophthalmological assessment 
including monocular and binocular visual acuity. Participants were grouped 
into four age-categories (Table [1]). 

A 3 Tesla scanner (Siemens Trio, Erlangen, Germany) was used at the Brain 
Imaging Centre (BIC), Frankfurt am Main, Germany. We recorded Tl weighted 
MRI (voxel-size Ixlxlmm'^) and Diffusion Tensor MRI (voxel-size 2x2x2mm'^). 
To minimize head motion, we used tightly padded clamps on the eight-channel 
head coil. The Tl weighted MRI was recorded with the following parameters: 
176 slices. Field of View (FoV) 256mm, repetition time (TR) 2250ms, and echo 
time (TE) 2.6ms. Three Diffusion Tensor Images per participant were recorded 
with the following parameters: 60 slices, FoV 192mm, TR 7600ms, TE 91ms, 
60 directions b- vectors with b-factor of 1000 and 10 bO images. 

2.2 Data Processing and Network Construction 

We used freesurfer to obtain surface meshes of the boundary between grey mat- 
ter and white matter from Tl anatomical brain images (http : //surf er . nmr .mgh . harvard . edu ) 
and Diffusion toolkit along with Track Vis (W ang et al.l . 120071 ) to obtain stream- 
line tractography from diffusion tensor images. We registered surface meshes 
into brains of the diffusion tensor images to extract networks. Freesurfer pro- 
vides parcellation of anato mical regions of cort i ces (33 for each he misphere) 



based on the Deskian atlas ( Desikan et al.l l2006t iFischl et al.l l2004bl ) and sub- 



cortices (jFischl et al.l . l2002ll2004a) . We subdivided those anatomical regions of 
cortices into many of similar surface area using our own algorithm. It is known 
that the size of each region of interest (R OI) aff ects network connectiv ity and 



various network measures (jZaleskv et all boiol : iHaemann et all . 120071 ). The 



core of our algorithm is an expectation-maximization (EM) algorithm. First, 
we choose the expected number of subdivisions (400, 800, and 1600) for cor- 
tical regions (we do not subdivide subcortical areas) and decided the number 
of subdivisions based on the ratio of each surface area over whole surface area 
in the average subject. Then we sub-divide each anatomical cortical region 
to the assigned number of subdivisions using the EM algorithm, minimizing 
variance of subdivisions' surface area sizes. The ratio of variance to mean 
does not exceed 10% in the average subject. In each subject, this ratio was 
higher (400: 414.78mm2 ± 140.06, 33.77%; 799: 207.65mm2 ± 77.51, 37.33%; 
1601: 103.63mm2 ± 44.33, 42.78%, statistics of cortical ROIs only) because the 
size of each subject's anatomical region varied (2468. 06mm^ ± 1813. lOmm^, 
73.46%, which represents individual subjects' regions were 12% ± 30% bigger 
than the average subject's; ratio of variance to mean is 26.49%). Another useful 
statistic is the ratio of interquartile range to median: larger values represent 
higher dispersion. T he values (Table [5]) are smaller than those determined by 
Fornito et aJ] (|2010l ). 



We projected this new parcellation template into each subject's surface. 
Using this projection procedure we kept topological consistency across all the 
subjects. We also included a few selected subcortical areas in the ROI list: 
Nucleus accumbens. Amygdala, Caudate, Hippocampus, Pallidum, Putamen, 
and Thalamus. Thus the actual number of ROIs (414, 813, and 1615, respec- 
tively) is slightly larger than specified. These three numbers of ROIs on the 
brain and resulting networks are shown in Fig. [2] Before further processing 
the diffusion tensor images, we serialised three diffusion tensor images and b- 
vectors for each participant; thus, a collated image of each subject has 30 bO- 
images and 180 diffusion images. The eddy current was corrected through FSL 



( http : //www ■ f mrib . ox . ac . uk/f si/) . We used Diffusion toolkit along with 
Track Vis (IWang et al. . 2007) with F iber Assignment by Continuous Tracking 



(FACT) algorithm ( JMori et al.l . Il999[) and 35 degree of angle threshold. After 



registering the surface meshes (FreeSurfer, http : //surfer . nmr . mgh . harvard . edu/ 1 



of each subject to his/her DTI, we generated cortical volume ROIs, which 
are voxels in the gray matter. Also the selected subcortical areas (see above) 
were registered to DTIs and used as ROIs. Then, we used the UMCP toolbox 
('http : //ccn . ucla . edu/wiki/index . php/UCLA_Multimodal_Connectivity_Package I 
to obtain connectivity matrices from the defined and registered ROIs and trac- 
tography. This counts the number of fibres from a tractography from the Dif- 
fusion toolkit, between all pairs of defined ROIs' grey matter; the fibre-counts 
between all ROI-pairs yield the weight-matrix. We also computed the average 
fibre lengths between ROIs (if there is no connection between a pair, the length 
is set to zero). The fibre length is based on the actual three-dimensional trajec- 
tory of the fibre tract and may be larger than the Euclidean distance between 
connected nodes. Note that long fibre length usually also corresponds to a large 
Euclidean distance between ROIs: earlier studies in fibre tracts in primates have 
shown that 85% of the fibr es go in a straight line or are only slightly curved 
( Hilgetag and Barbasll2006 ). However, only a fully bended (U-turn) fibre could 



result in a long fibre tract connection to spatially nearby nodes. 



DTI fibre-tracking yielded weighted matrices where weights indicate the 
probabihty of connections between corresponding ROIs. Non-zero weights have 
been interpreted as an indication for a hnk when constructing the brain net- 
works. The resulting adjacency matrices were similarly sparse for each network- 
resolution (Fig. [U average link-densities 7.5%, 4.0%, and 2.0% for networks of 
414, 813, and 1615 nodes, respectively). 



2.3 Network Analysis 

Single node-motifs wer e identified from th e constructed networks by applying a 
technique presented bv lCosta et al.l (2009 ) with ad ditions in order to choose pa- 
rameters automatically ([Echtermever et al.l . 120111 ) . The method yields a global 
characterisation of the network based on its local properties. Here, each node is 
quantified by 9 difi'erent local measures, which are motivated in the next para- 
graph. We chose such a large number of local measures, some of which share 
similarities, to ensure that as many network features as possible are considered. 
Redundancies, where two features are highly correlated, are removed during the 
node-motif detection, which is explained further below. 

A nodes' degree is its number of connections to other nodes; dividing this by 
the number of all links in the network yields the normalised node degree K. The 
average over all neighbours' degrees is called the normalised average degree r. 
(Nodes that are directly linked to each other are called neighbours.) Degrees 
of a node's immediate neighbours can further be taken into account by their 
respective coefficient of variation cv. Connectivity among neighbours of a node 
is quantified by the clustering coefficient cc, which r eflects how many of all pos- 
sible connections be tween neighbours actually exist (jWatts and Strogata . Il998t 
Kaiser et al.l . l2007al) . The hierarchical clustering coefficient of le vel two CC2 ex- 
tends this concept to connections between neighbours' neighbours ( Costa and Silval . 
20061 ). To what degree a node's neighbours connect to the same target is quan- 
tified by the localit y index lac, which is based on the matching index (e.g. 
Kaiser and Hilgetaei , l2004a ) . The 6 measures mentioned so far quantify topo- 
logical aspects of the network, while the next 3 measures also take into ac- 
count spatial features. The average connection length acl for a node is the 
average length of fibre tracts to its neighbours and the furthest away neigh- 
bour determines the maximum connection length mcl. The average over con- 
nection lengths to nodes two steps away, i.e. neighbours' neighbours, is called 
the average i ndirect reach air. Further details and measures can be found in 
the literatur e ( Albert and Barabasii l20Q2t iNewmanl . 120031 : iNewman et al.l . l2006t 



Costa et al.Ll2007bl) . 

The motif-detection workflow starts by applying the above listed network 
measures (Fig. |3|) : Local network measures are calculated for all nodes of the 
network, which yields a 9-dimensional characterisation (feature vector) for each 
node. This high-dimensional repr esentation is simplifl ed by exploiting simi- 



larities between different measures (ICosta et al.l . l2007bD : Using principal com- 
ponent analysis (PCA) ( Johnson and WichernL 120071 Chapter 8), correlation 



between measures is removed and the feature vectors are reduced to two dimen- 
sions [PCA-plane). The position on the PCA-plane characterises the nodes and 
allows for comparisons: Nodes close to each other share similar features whereas 
well separated ones differ. The next step is to estimate how likely speciflc combi- 
nations of features are. This is done by smoothing over points in the PCA-plane 



using the Parzen window approach (|Parzeni Il962l : iDuda et all . I2OOIL Chap 



ter 4.3) with width of the smoothing kernel scaled according to the standard 



devia tion along the corresponding principal component axis (Ec htermever et al. 



20111 ). Thereby, each node is assigned a probability, which is used to distinguish 



those with common and rare features. The w many nodes with uncommon 
properties, termed outlier nodes, were determined as follows. Given all nodes' 
probabilities p = (j>k)k=i.,....n (sorted increasingly), the respective mean p, and 
standard deviation cr(p), the number of singular nodes w is chosen as 

w = arg max pk+i ~ Pk , (1) 

k ■■ Pk< P~cr{p) 

or w — 0, if all probabilities are greater than p — cr(jj). Nodes corresponding 
to the smallest w probabilities are the identified outliers. Next, the outlier 
nodes were assigned to clusters each of which represents a different motif-group, 
i.e. a set of features. This was done by first centring equally sized ellipses on 
each point and determining their overlap. Maximal sets of nodes whose ellipses 
form a connected area yield both the number k of motif-groups as well as the 
groups themselves. (The above mentioned mecha nisms have been evaluat ed 
and discussed in detail in an earlier publication ( Echtermever et al.l , 120111 ).) 



Motif-groups correspond to 9-dimensional motif-regions in feature space, such 
that all nodes within a motif-group are similar with respect to the local network 
measures used in the beginning. In the final step, motif regions from all networks 
were reduced to the final 9 motif regions by subsequently joining the closest 
two of them. The pa irwise distances hav e been determined with a modified 
Mahalanobis measure ( MahalanobisL 119361 ). The number of nodes that express 



a particular motif yields a fingerpri nt of the n etwork, which characterises the 



network and allows for comparisons (jCosta et al. . 2009). e.g. by considering the 



motif-diversity, which is the number of unique motifs k that were identified. 
Another possibility is to compare the number of outlier nodes w. 

3 Results 

3.1 Motif Changes with Age 

Networks were first analysed with respect to age-dependent changes. Therefore, 
outlier motifs were determined for all networks to see whether certain motifs 
might only be expressed in subjects of similar age. This was not the case, but 
the total number of outlier-nodes w shows a characteristic pattern with age 
(Fig. m solid blue lines). Subjects of young ages (13-14 years) show 3-4 outher 
nodes on average — a number that is reduced throughout the teenager years and 
thereafter (15-20 years) until a sudden and predominant peak at the age of 21. 
Subjects aged 22-23 years have again fewer, but still several outlier nodes. This 
pattern is qualitatively preserved for different network resolutions (Fig. [¥] top 
to bottom). Quantitatively, the number of outlier nodes w rises with larger 
networks. 

To determine which changes are statistically significant a random permu- 
tation test has been applied: Subjects were randomly assigned an age (12- 
23 years) while ensuring that each age-group had the same size as for the original 
data. Subsequently, each group's mean w and k were compared to those of the 



original data to compute the P-values (using a total of 500,000 permutations). 
Results, shown directly in Fig. 21 represent how far each group is to the average 
over age. Note that except for the lowest network resolution, the large spike 
at age 21 does not lead to significant increase in w (and k). This is because 
our data only include one subject aged 21 and further data would be needed to 
confirm that the deviations at this age are indeed significant across all scales of 
resolution. 

A pattern that is very similar to that of the outlier number w is the diver- 
sity k of expressed motifs. The number k is high and low for nearly the same 
ages as was the number of outliers w before (Fig. |4l dashed orange lines), which 
is also reflected by the strong correlation between w and k (Pearson correla- 
tion 0.94, 0.88, and 0.87 for 414-, 813-, and 1615-node networks, respectively). 
Peaks at low (13-14 years) and high (21 years) ages can be seen with a dip 
in between. This pattern fits networks with 414 nodes best, whereas networks 
with the higher resolution (813 and 1615 nodes) show an increasingly flat pro- 
file except for the later peak (21 years). Note that absolute motif-diversity k 
is relatively similar for all network resolutions, which was not the case for the 
actual number of outliers w. 

Both, number of outliers w and motif-diversity k are interesting on their 
own, but their strong correlation provides additional information. Although the 
absolute values differ, similarity of the curves' profiles is independent of network 
resolution. We thus find the very robust effect that motif-diversity is coupled 
to the number of outlier nodes. In other words, whenever more outliers occur, 
they also become more diverse. 

3.2 Motif Changes with Network Resolution 

The previous section has already shown that network resolution affects outlier 
quantity, w, but not the relative distribution across the age-range (Fig. 21 solid 
blue lines). Different from that, motif-diversity — i.e. how many different mo- 
tifs are expressed, k — is nearly invariant on an absolute scale, in contrast to 
its relative distribution (Fig. HJ dashed orange lines). We tested whether the 
number w and diversity k of outliers, at different spatial scales, are correlated 
with cognitive skills of our subjects. Unfortunately, measures of intelligence 
yielded by four sub-scores of the Wechsler intelligence test were only available 
for 44 out of our 53 subjects. For all comparisons, the Pearson correlation was 
r < 0.5. Therefore, we cannot confirm a link between intelligence and w or k. 
However, intelligence might still be related to a specific combination of motifs; 
a possibility which could only be tested given a larger cohort of subjects. Next, 
we analyse the actual frequencies with which motifs are expressed by different 
age groups (Table [1]). 

Using the same raw DTI-data, networks have been constructed with different 
resolutions, i.e. differently sized ROIs, for each subject. More fine grained 
ROIs parcellate the brain into more elements, each of which is represented by a 
network node. We were interested in the effect that different network resolutions 
have on the occurring node-motifs (Table[3l Fig. [5]) and therefore, networks with 
414, 813, and 1615 nodes were analysed and compared to each other. 

The results show that motif-expression strongly depends on network-resolution 
(Fig. ini). Whereas motif 4 is most frequent for the 414- node resolution, it is 
nearly absent in the highest resolution of 1615 nodes. Other motifs also show 
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large fluctuations between spatial resolutions with either increasing or decreas- 
ing frequency. Motifs can also be characteristic for a single scale. For example, 
motifs 8, 9, and 10 are most prevalent across age groups at an intermediate 
resolution of 813 nodes. This shows that the spatial resolution is critical for as- 
sessing the frequency of motifs and, presumably, also for its underlying spatial 
and topological features. 

Motif counts also depend on age. For 414 node networks, many motifs 
only occur for one or two out of the four age groups. Whereas this might be 
influenced by the low number of nodes in the network, we also see age-based 
motif frequency changes for higher spatial resolutions. Looking at the resolution 
of 1615 nodes, the number of nodes showing motifs 3 and 5 increases with age 
whereas motifs 8 and 9 become less frequent. These age-dependent trends are 
less visible at 813-node resolution and they disappear for the 414-node networks. 

3.3 Fibre length distribution for different network sizes 

A possible explanation for the observed dependence on network-resolution could 
be fibre-length based network measures (namely: average connection length, 
maximum connection length, and average indirect reach). These measures take 
into account the determined fibre trajectory between ROIs, i.e. their fibre tract 
length. It is thus to be expected that the number of ROIs impacts on the corre- 
sponding fibre lengths. Wc tested this by observing the fibre length distribution 
for different network resolutions (Fig. [7]). Across all resolutions, short fibres are 
more frequent than lon g fibres. This expo nential tail, which can be fitted by 
a Gamma distribution ( Kaiser et al.L 120091) . ha, s been reported before not only 



at the level of fibre tracts (JKaiser and Hilgetaei l2004b), but also at the level of 



connections between neurons within cortical areas ( HellwieJ . 120001) . One notable 
difference between spatial resolutions is the increase in the number of short fi- 
bres (length<10mm) increasing by more than 50% from the lowest to the highest 
spatial resolution. This can be due to the smaller size of network nodes where 
surface regions that before belonged to the same node are now separate nodes on 
the surface. Therefore, a short fibre tract between these nodes becomes feasible. 
Looking at the maximum range of fibres, spatial resolution does not seem to 
influence maximum fibre lengths. However, differences in average fibre length, 
in particular due to an increased number of short fibre connections, might occur 
for particular nodes. The pattern for average indirect reach is more complex 
and can be influenced by changes in both short and long fibres. Overall, given 
a three-fold increase in spatial resolution, the fibre length distributions remain 
remarkably similar. 

3.4 Motif consistency across spatial scales 

How consistent are motifs across different resolutions? If a region splits into 
several regions for higher resolutions, do the daughter regions show the same 
motif as the parent region? Interestingly, this does not seem to be the case. Fig- 
ure [5] shows a typical example of motif locations for one subject. For 414 nodes, 
one location shows motif 3. One part of this location then shows motif 8 for 
the resolution of 813 nodes. For 1615 nodes, part of the original region shows 
motif 3 again, whereas another part shows motif 9. This is an interesting point 
when talking about the connectivity between brain regions: A pattern shown at 
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the low resolution level (e.g. cortical regions or Brodmann areas) may consist 
of several diverse patterns when observing connectivity at a higher resolution. 



4 Discussion 

We observed how the distribution (fingerprint) of characteristic motifs of sin- 
gle network nodes changed for human structural connectivity (diffusion tensor 
imaging) during brain development. We studied the role of both temporal scales 
(age groups from 12-23 years) and spatial scales (414, 813, and 1615 nodes) and 
found that, first, the number and diversity of motifs in a network are strongly 
correlated. Second, comparing different scales, the number and diversity of mo- 
tifs varied across the temporal (subject age) and spatial (network resolution) 
scale (with certain motifs only occurring at one spatial scale or for a certain age 
range). Third, the sub- regions of a node, using a higher spatial resolution, may 
or may not include the original motif at the lower resolution. Therefore, both 
the type and localisation of motifs differ for different spatial resolutions. This 
indicates that spatial resolution is crucial when comparing characteristic node 
fingerprints given by topological and spatial network features. This result is in 
line with p revious studies that obs erved the role of sampling on brain network 
properties ( Antiaueira et al.L 120101) and it additionally shows the infiucncc of 
the temporal scale. 

Networks can be characterized at different levels. Aggregate measures that 
characterize a network as a whole can be used to distinguish different net- 
work types, e.g. sm all-world ([Watts and Strogatz. 1998 : Sporns et al.', l2000t 



Iw 

edl 



Hilgetag et ani2000| ) scale -free feguiluz ct al.. 2005|: | Kaiser et a l.. 2007b), mod 



ular (JHilgetag et all . 120001) , or hierarchical (jKaiser et al.L |2010|) . Networks can 



also be characterized at the level of individual compon ents. Looking at th e topo- 



logical features of individual nodes, a first study by IPassingham et al.l (|2002l ) 



has linked the network features of macaque cortical areas with the function of 
each area. Whereas this approach considered all nodes of the network, there are 
also ways to search for 'special' nodes of a system. Following the ideas of scale- 
free networks, brain areas with a large number of connections — so- called networ k 
hubs — might be crucial for in tegrating or distributing information (jKaiser et al 



2007bl ). ISporns et al.l (|2007| ) were able to detect and classify different kinds of 
hubs of cortical structural connectivity. However, hubs are just one type of 
outliers where one or more node features (here: the number of connections of 
a node) differ from the average value of nodes in the netw ork. In two previous 
manuscripts (jCosta et al.l , 120091 : Echtermever et al.l I2OIII) , we have developed 
a tool that systematically searches for and classifies nodes which differ from 
the majority of nodes in a network. This approach includes the joint analy- 
sis of multip l e featu res, which include topological as well as spatial properties 
(jCostaet al.Ll2007al) . 

Our measures are based on outliers, which do not show the o verall trend as 



shown in previous studies ( s patial resolution dependent changes: Zaleskv et al.l_ 
(|2010l ): lBassett et al.l ([2010) :lForni to et al.l (l2010l):[ 



Havasaka and L aurien til (I201C 



age d e pendent change s: Fan et a l. (201(|); iHagmann et al.l (12010); 



(|2009l ): lUhlhaas et al.l (|2009l) . Table ISTj) . Thus, it's difficult to directly relate 
our results to their findings. However, our report that edge density decreased 
with increased spatial resolution is consistent with previous studies of struc- 
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tural connectivity ([Zaleskv et all I2OIOI : iBassett et al.l I2OIOI) . For the temporal 
resoluti on, most previous stud ies showed no overlap with the age range of our 
study. iHagmann et al.l ( 20101 ) performed a study with overlapping temporal 
scales, but only observed ages up to 18 years and thus only about half of the 
temporal scales considered in our study. Studie s with larger ove rlap focu ssed on 



2OIOI : lUhlhaa s et al. 



funct ional rather than structural connectivity (|Fan et al 
I2OO9I) . 

Here we noted that surface areas of used region-of-interest(ROI)s varied 
around 33%-43% compared to their mean (shown only for the cortical ROIs 
because subcortical ROIs were not subdivided. Table [2]). This variation stems 
from the variation of anatomical regions' surface areas (ratio of its standard 
deviation to its mean is 73.5%). The rati o of interquartile ra nge to median 
showed comparatively small ROI variation ( Fornito et al.l . uOlOf) . To keep the 
topological consistency, this might be the best we can do. At least, this is 
better than using just anatomical regions whose variation is much higher (73.5% 
variation) . 

This node fingerprint study is a proof of principle for the current technique 
showing the influence of spatial and temporal scales on network comparison. 
However, the current results could also be influenced by the following points. 
First, the number of subjects per age group was not identical (Tabled)). There- 
fore, findings in age groups with fewer members might have been influenced by 
relatively few outliers. Whereas an identical number of subjects per group is 
desirable, for our study there was also a trade-off with the age range: using the 
same subject numbers would have led to groups where the age ranges differ, 
e.g. covering 3 years for one group and 5 years for another. We decided for 
a different number of subjects per group in order to perform the comparison 
across the temporal scale. Second, single node motifs become more robust for 
larger network sizes. As node motifs are outliers, only a small percentage of 
all nodes will be characterized as node motifs. Therefore, for low-resolution 
networks, node motifs might only occur for few subjects. In addition, their 
number might be a poor estimate of the underlying frequency in human connec- 
tivity networks. For this reason, we have only reported networks with at least 
414 nodes, leaving out networks with a parcellation into 110 cortical and subcor- 
tical regions. Third, node features but not necessarily motifs depend on the edge 
density of the network. In our developmental networks, we observed significant 
edge density changes for different spatial resolutions ranging from more than 
7% for 414 nodes to 2% for 1615 nodes. However, for each network resolution, 
edge densities remained comparable for different temporal scales (age groups) 
(Fig.[T]). Fourth, deterministic tra cking cannot capture c rossing fibres which can 



be done i n probabilistic track i ng (iBehrens etal.1. 120071) . in Diffusion spectrum 



2010[). or in h i gh-an gular resolu- 



imaging (jWedeen et al.l . l2008t iBassett et ahL . 

tion diffusion imaging I'HARDL lTuchI (|2004h : Tzaleskv et aJ] (|2010l) ). However, 
fibre length distributions, influencing our three spatial features, were similar 
across resolutions (Fig. [7]). 

Network Science has led to a wide range of tools for analysing neural systems 



( Sporns et al. . 2004tlCosta et al.l . l2007btlBullmore and Spornsil2009l : lRubinov and Sporns 



2010l : lKaiseill201l[l . Whereas the characterization of individual networks, e.g. as 
small- world or scale-free, is now possible, comparing different networks is still a 
challenge. Our study shows how node properties, given by characteristic single- 
node motif fingerprints, can be compared between different networks. Studying 
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single node properties is only one way to characterize networks with alterna- 
tives being (a) the study of global properties including loca l and global efficiency 
( Latora and Marchiorii uOOlt lAchard and Bullmorel 120071 ) as well as modular- 



ity (JNewmanl . 120061 ). (b) the comparison of the cluster org anisation of different 



netw orks, and (c) direct comparison of network matrices (|Crofts and Higham . 



20091 ) ■ However, it is problematic for these measures that network features are 



influenced by the number of nodes and edges of a network; feature changes be- 
tween ne tworks might simply b e due to different edge densities in the compared 
systems ( van Wiik et all 120101) . For our networks, edge densities did not signifi- 



cantly differ across age but they differed across spatial resolutions. Whereas the 
underlying topological and, to some extent, spatial features might have changed 
in our networks, our motif fingerprints do not depend on the absolute values for 
network measures but on their distribution: an overall increase or decrease in 
the average value of a measure will have less of an influence on the number of 
node motifs as these depend on statistical outliers and not on absolute values 
of a measure. 

Nonetheless, nodes that present characteristic node motifs, as well as hubs, 
are rare within networks. These fingerprints can therefore only be applied for 
high-resolution networks (>414 nodes) and large groups of subjects. Fortu- 
nately, large subject cohorts are currently being recruited in several initiatives 
including the Human Connectome Proj ect for structural c onnectivity and the 
1000 Functional Connectome Project ( Biswal et al.l . 120101 ). In addition, fin- 



gerprints will also be useful for high-resolution networks at the sc ale of the 



micro -connectome observing connections between individual neurons (|DeFelipel . 



20101 ). Another potential application lies in the analysis of multi-electrode 



array recordings, whi ch nowadays can record from more than 4,000 channels 
( Sernaeor et al.l . l2010r) . We therefore made this tool available within the CAR- 



MEN initiative (http: //www. carmen. org .ukl for developing electrophysiology 



analysis tools ()Smith et al.l . 120011) . The tool is also available on our website 



( |http: //www. biological-networks . org/p . 

5 Conclusion 

In this study we found that (a) node motifs change over time and can there- 
fore be used to find characteristic changes in the developing brain and (b) mo- 
tifs change over spatial scales, therefore comparisons between studies are only 
meaningful when the same (or a comparable) spatial resolution has been used. 
Our results also indicate that spatial resolution has a higher effect on topolog- 
ical measures whereas spatial measures, based on fibre lengths, remain more 
comparable between resolutions. As node motifs are based on topological and 
spatial properties of brain connectivity networks, these conclusions are also rel- 
evant to other studies using network analysis. Another important aspect is the 
analysis of differences between healthy controls and subjects with brain disor- 
ders that can arise during brain network development, such as schizophrenia and 
epilepsy; we hope that our node fingerprint approach will be useful for detection 
and specification in these cases. 
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Tables 

Tabic 1: Ranges of agc-catcgories. 



12- 


-14 years 


9 


15- 


-17 years 


20 


18- 


-20 years 


16 


21- 


-23 years 


8 



category age-range number of subjects 

1 
2 
3 

4 



Table 2: Region of Interest (ROI) surface area (mni^) statistics with ratio of 
interquartile range to median. 

parcellation median interquartile range ratio 

05 
0.46 
0.52 
0.59 



Table 3: Properties of most frequent motifs. Symbols ■ • • , t; and ]. indicate 
normal, elevated, and decreased values of normalised node degree K, normalised 
average degree r, coefficient of variation of neighbours' degrees cv, locality 
index loc, clustering coefficient cc, hierarchical clustering coefficient of level 
two CC2, average connection length acl, maximum connection length mcl, aver- 
age indirect reach air, respectively. 
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Figures 
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Figure 1: Edge density for different spatial resolutions, given by the number 
of network nodes, and different age groups. Data points show the average edge 
density of all networks belonging to a certain age group and spatial resolution. 
Corresponding standard deviations are shown by error-bars. 
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Figure 2: Region of Interests (ROIs) on the brain (left) and the resulting 
networks (right) with different numbers of ROIs. 
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Figure 3: Figure from lEchtermever et al.l (J201ll) . Illustration of network- 
analysis work-flow: High dimensional characterisation of all network nodes 
through local network measures fii (Step 1) is compacted to two dimensions 
(Step 2) in order to estimate a probability distribution (Step 3), which is used 
to identify nodes with miconrmon features (Step 4). All nodes are grouped 
(Step 5) to form high dimensional motif- regions (Step 6), which are eventu- 
ally joined, if too close to each other (Step 7). The number of nodes in each 
motif- region yields a fingerprint of the network (Step 8). 
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Figure 4: Motif-expression changing with age: Both the number of outher nodes 
in a network w (sohd blue Une) and the diversity of motifs expressed k (dashed 
orange hne) vary with subject age. The time-dependent patterns of w and k are 
shown for different network resolutions (rows). Age-groups indicated by dashed 
vertical lines. Significantly de- or increased values for w and fc are indicated 
by symbols < and >, respectively (* 90%, ** 95%, and *** 99% significance). 
Note that our data only include one subject aged 21 and further data would be 
needed to confirm significance of deviations at this age. 
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motif 3 



motif 4 





motif 5 



motif 8 





motif 9 



motif 10 





Figure 5: Stylised illustration of most frequent motifs (Table [S]): mo- 
tif 3 Provincial node with connections in the direct (high cc) but not the larger 
neighbourhood (low cc2, air, and met), motif 4 Provincial huh with more con- 
nections than its neighbours (high r) that are less connected between themselves 
(low ee). motif 5 Global bottleneck with few (low K) but long-range connec- 
tions (high acl, mcl, air) that reach beyond the local neighbourhood (low cc). 
motif 8 Global uniform bottleneck sharing properties of motif 5 but connected 
to nodes with similar degrees (low cv). motif 9 Global local bottleneck shar- 
ing properties of motif 5 but also having well-connected neighbours (high cc) 
thus better informing local circuits, motif 10 Provincial bottleneck with only 
short-range connections (low mcl) and few connections between its neighbours 
(low cc) leading to a large influence on the local circuit. 
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Figure 6: Motif-expression changing with network-resolution. Plots show dis- 
tribution of outlier nodes among motifs 1-5 and 7-10. Motif 6 (not shown) 
corresponds to the remaining 98% network nodes with common features (regu- 
lar nodes). 
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Figure 7: Fibre length distribution, as the length of the trajectory in mm, 
for different network resolutions. Relative frequencies for low (414 nodes, top), 
medium (813 nodes, middle), and high (1615 nodes, bottom) spatial resolution. 
Note that longer fibres (> 200mm) occurred so infrequently that corresponding 
bars (not shown) would be invisible. 
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Figure 8: Example for the motif distribution in one subject (19 years old) for 
different spatial resolutions. Different number of regions of interests (ROIs, left: 
414, middle: 813, and right: 1615) with different views: left lateral view, left 
superior view, right superior view, and right lateral view in order from the top. 
yellow: motif 3, magenta: motif 4, cyan: motif 7, red: motif 8, green: motif 9, 
blue: motif 10 (motifs 1, 2, and 5 were not present). 
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Table SI; Comparison of our findings with previous studies (SC: structural 
connectivity, rsFC: resting state functional connectivity, m/o: months-old, 
y/o: years-old, ED: edge density, CC: clustering coefficient, CPL: character- 
istic path length, ADC: apparent diffusion coefficient, FA: fractional anisotropy, 
7: CC/CC,and' A: CPL/CPL,^j,d)- 



study 



findings 



Spatial scale: 

Ours 

Zaleskv et aJ] (|2010f ) 



Bassett et all (|2010l) 



Hagmann et al 



(2010) 



Fornito et aJ] (|2010l) 



Havasaka and Laurient I 
(l20inl) 



As the number of nodes increases, 

in SC (414, 813, and 1615 nodes), ; ED, t w (See Fig.Sl) 

in SC (6 different scales between 82 and 4000 nodes), 

4, ED, t small worldness, t 7i 4- global efficiency, changes 

in nodal rank degree and betweeness centrality 

in SC (12 different scales between 54 to 880), 4, ED, t 7, 

t A, conserved hierarchy, t Rentinan scaling 

in SC (66 and 241 nodes), 4, ED, t CC, ; Efficiency, ; Node 

strength 

in rsFC (7 different scales between 84 and 4320 nodes), 

4- average correlation, t size of largest component, I path 

length, t CC (for low ED), i CC (for high ED) t small 

worldness, changes in nodal rank degree 

in rsFC (voxel-based nodes with 3 different voxel sizes and 

region-based nodes), changes in node degree distribution 

t CC, t path length (while increasing voxel sizes, but it 

decreased with region based node) 



Temporal scale: 

Ours 



Hagmann et al. 
(I2OIOI) 



Fan et alj (|2010l) 



Fair et alj (|2009l) 



Uhlhaas et al. (1200 



As the age increases, 

in SC (12 to 23 y/o), a characteristic pattern over spatial 
scales (See Fig. S]) 

in SC (18 m/o to 18 y/o), 4- rnean ADC, t mean FA, 
4. ED, ; CC, t Efficiency, t Node strength, t SC-FC cor- 
relation, no changes in modularity and no major changes 
in module composition after 2 y/o, no significant changes 
in betweeness-centrality 

in SC (1 m/o, 1 y/o, 2 y/o, and adult), '[ global effi- 
ciency, t cost efficiency, a peak at 2 y/o in local efficiency 
and modularity, 'I size of largest component in modules, 
changes in module assignment and participation coeffi- 
cient 

in rsFC (7 to 31 y/o), no significant changes in optimized 
modularity Q, CC and CPL. Changed module assignment 
and the number of long-distance correlations f- 
in EEG (6 to 21 y/o), strong correlation between neural 
synchrony and cognitive performance. As ages increase, 
the neural synchrony was increasing in early adolescence, 
then decreasing in late adolescence, and finally increasing 
again in adult 
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